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For a broad class of nonlinear regression models we investigate 
the local E- and c-optimal design problem. It is demonstrated that 
in many cases the optimal designs with respect to these optimality 
criteria are supported at the Chebyshev points, which are the local 
extrema of the equi-oscillating best approximation of the function 
/o = 0 by a normalized linear combination of the regression functions 
in the corresponding linearized model. The class of models includes 
rational, logistic and exponential models and for the rational regres¬ 
sion models the E- and c-optimal design problem is solved explicitly 
in many cases. 


1. Introduction. Nonlinear regression models are widely used to describe 
the dependencies between a response and an explanatory variable [see, e.g., 

Seber and Wild (1989), Ratkowsky (1983) or Ratkowsky (1990)]. An ap¬ 
propriate choice of the experimental conditions can improve the quality of 
statistical inference substantially and, therefore, many authors have dis¬ 
cussed the problem of designing experiments for nonlinear regression mod¬ 
els. We refer to Chernoff (1953) and Melas (1978) for early references and 
Ford, Torsney and Wu (1992), He, Studden and Sun (1996) andDette, Haines and Imhof 
(1999) for more recent references on local optimal designs. Because local op¬ 
timal designs depend on an initial guess for the unknown parameters, sev¬ 
eral authors have proposed alternative design strategies. Bayesian or robust 
optimal designs have been discussed by Pronzato and Walter (1985) and 
Chaloner and Larntz (1989), among many others [see Chaloner and Verdinelli 
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(1995) and the references therein]. Other authors propose sequential meth¬ 
ods, which update the information about the unknown parameter sequen¬ 
tially [see, e.g., Ford and Silvey (1980) and Wu (1985)]. Most of the lit¬ 
erature concentrates on H-optimal designs (independent of the particular 
approach), which maximize the determinant of the Fisher information ma¬ 
trix for the parameters in the model, but much less attention has been paid 
to F^-optimal designs in nonlinear regression models, which maximize the 
minimum eigenvalue of the Fisher information matrix [see Dette and Haines 
(1994) or Dette and Wong (1999), who report some results for models with 
two parameters]. 

Because local optimal designs are the basis for all advanced design strate¬ 
gies, it is the purpose of the present paper to study local i?-optimal designs 
for a class of nonlinear regression models which can be represented in the 
form 

s k 

(1.1) Y = ^aihi{t)+'^as+i^{t,hi)+e. 

i=l i=l 


Here 99 is a known function, the explanatory variable t varies in an interval 
/ C M, e denotes a random error with mean zero and constant variance and 
ai,..., a^+fc, 61 ,..., SM denote the unknown parameters of the model. 
The consideration of this type of model was motivated by recent work of 
Imhof and Studden (2001), who considered a class of rational models of the 
form 


( 1 . 2 ) 


Y = J2aif-^ 

i=l 


+E 

2=1 


^S+2 

t - bi 


+ 


where t € I, bi ^ bj {i j) and the parameters bi ^ I are assumed to be 
known for all i = 1,... ,k. Note that model (1.2) is in fact linear, because 
Imhof and Studden (2001) assumed the bi to be known. These models are 
very popular because they have appealing approximation properties [see 
Petrushev and Popov (1987) for some theoretical properties and Dudzinski and Mykytowycz 
(1961) and Ratkowsky (1983), page 120, for an application of this model]. In 
this paper [in contrast to the work of Imhof and Studden (2001)] the param¬ 
eters bi,... ,bk in the model ( 1 . 1 ) are not assumed to be known, but also have 
to be estimated from the data. Moreover, the model (1.1) considered here 
includes numerous other regression functions. For example, in environmen¬ 
tal and ecological statistics exponential models of the form -|- 026 ^^* 

are frequently used in toxicokinetic experiments [see, e.g., Becka and Urfer 
(1996) or Becka, Bolt and Urfer (1993)] and this corresponds to the choice 
x) = in (1.1). Another popular class of logarithmic models is obtained 
from equation ( 1 . 1 ) by the choice = log(t — x). 
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Imhof and Studden (2001) studied iii-optimal designs for the model (1.2) 
with s = 1 under the assumption that the nonlinear parameters bi,... ,bk 
are known by the experimenter and do not have to be estimated from the 
data. In particular, they proved that the support of the £i-optimal design for 
estimating a subset of the parameters ai,..., is given by the Chebyshev 
points corresponding to the functions I, ■ ■ ■, in the model (1.2). 

These points are the extremal points of the function 1 + =P*{x), 

in the interval I, which has the smallest deviation from zero, that is, 

(1.3) sup|p*(x)|= min sup 

x&l o. 2 ,...,ak+i 

The universality of this solution is due to the fact that any subsystem of 
the regression functions in the model (1.2), which is obtained by deleting 
one of the basis functions, forms a weak Chebyshev system on the interval I 
[see Karlin and Studden (1966) or the discussion in Section 2], However, 
in the case where parameters bi,... ,bk are unknown and also have to be 
estimated from the data, the local optimal design problem for the model (1.2) 
is equivalent to an optimal design problem in the linear regression model 


1 + 


k 


2=1 


X - bi 


(1.4) 




2=1 


2=1 


/ Ps+2i-l /3s+2i \ 

\ t-bi {t- bi)"^] 


+ 


for which the corresponding regression functions do not satisfy the weak 
Chebyshev property mentioned above. Nevertheless, we will prove in this 
paper that in cases with k>2, where the quantity maxj^j |6j — bj\ is suf¬ 
ficiently small, local S-optimal designs and many local c-optimal designs 
for estimating linear combinations of the parameters are still supported on 
Chebyshev points. This fact simplifies the construction of local H-optimal 
designs substantially. Moreover, we show that this result does not depend 
on the specific form of the model (1.2) and (1.4) but can be established for 
the general model (1.1) (or its equivalent linearized model). Additionally, 
it can be shown numerically that in many cases the K-optimal design is, 
in fact, supported on the Chebyshev points for all admissible values of the 
parameters 6 i,..., 6^ (6* / 6^; z / j). Our approach is based on a study of the 
limiting behavior of the information matrix in model (1.1) in the case where 
all nonlinear parameters in the model (1.1) tend to the same limit. We show 
that in this case the local K-optimal and many local optimal designs for es¬ 
timating linear combinations of the coefficients as+i,bs+i,... ,as+k,bs+k in 
the model (1.1) have the same limiting design. This indicates that iii-optimal 
designs in models of type (1.1) yield precise estimates of the individual co¬ 
efficients and we will illustrate this fact in several concrete examples. 

It is notable that the results regarding local E- and c-optimal designs in 
the regression model (1.1) based on Chebyshev approximation are obtained 
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under the simplifying assumption that bi = x + 6ri {i = 1,... ,k) with 7 ^ 
rj and 6 sufficiently small. Obviously, every vector b= ( 61 ,..., 6 ^) can be 
represented in this form, but the answer to the question if 6 is sufficiently 
small such that our results are applicable depends on the basic function ip 
used in (1.1) and the vector b itself. However, the theoretical results of this 
paper suggest a simple procedure to obtain E- and c-optimal designs for the 
model (1.1). We use the designs derived under the simplifying assumption to 
obtain candidates for the optimal designs and check the optimality of these 
candidates by using equivalence theorems or alternative characterizations. 
Moreover, the examples of this paper and additional examples in a technical 
report of Dette, Melas and Pepelyshev (2002) indicate that in many cases 
the designs obtained under the simplifying assumption yield, in fact, the E- 
or c-optimal designs. 

The remaining part of the paper is organized as follows. In Section 2 we 
introduce the basic concepts and notation, and present some preliminary 
results. Section 3 is devoted to an asymptotic analysis of the model (1.1), 
which is based on a linear transformation introduced in the Appendix. Fi¬ 
nally, some applications to the rational model ( 1 . 2 ) and its equivalent linear 
regression model (1.4) are presented in Section 4, which extend the results 
of Imhof and Studden (2001) to the case where the nonlinear parameters 
in the model ( 1 . 2 ) are not known and have to be estimated from the data. 
Finally, all proofs and technical details are deferred to the Appendix. 

2. Preliminary results. Consider the nonlinear regression model (1.1) 
and define 

( 2 . 1 ) ^ 

= ihi{t ),..., hs(t),ip{t, bi),p\t, 61 ),..., p{t, bk),p\t, bk)) 

as a vector of m = s + 2k regression functions, where the derivatives of the 
function p are taken with respect to the second argument. It is straightfor¬ 
ward to show that the Fisher information for the parameter (/3i,..., /9m)^ = 
(3 in the linear regression model 

Y = 13^ f{t,h) + e 

( 2 - 2 ) 

= ^ (3ihi{t) + ^ {f3s+2i-w{t, hi) + (3s+2i^'{t, hi)) + e 

i=l i=l 

is given by f{t,b)f'^{t,b). Following Kiefer (1974), we call any probability 
measure ^ with finite support on the interval I an (approximate) design. 
The support points give the locations where observations have to be taken, 
while the masses correspond to the relative proportions of total observations 
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to be taken at the particular points. For a design ^ the information matrix 
in model (2.2) is dehned by 

(2.3) M{C,b) = J^f{t,b)f{t,b)dm, 

and a local optimal design maximizes an appropriate function of the informa¬ 
tion matrix [see Silvey (1980) or Pukelsheim (1993)]. The dependence on the 
parameter b is omitted whenever it is clear from the context. Among the nu¬ 
merous optimality criteria proposed in the literature, we consider the E- and 
c-optimality criteria in this paper. An Fi-optimal design maximizes the 
minimum eigenvalue Amin(Af(^, 6)) over the set of all approximate designs, 
while for a given vector c G M"* a c-optimal design minimizes the expression 
c^M~ (^, 6)c, where the minimum is taken over the set of all designs for which 
the linear combination c^/3 is estimable, that is, c G range(M(^, 6)) V6. A 
particular case appears for the choice c = Cj, where Cj G M”* (i = 1,... ,m) 
is the zth unit vector. In this case we call the c-optimal design optimal for 
estimating the individual coefficient Pi. 

Note that the information matrix in the nonlinear regression model (1.1) 
is given by K~^M{^,b)K~^, where the matrix Ka G is defined by 

(2.4) Ka = diag (" 1,..., 1,1, —, 1,..., 1, — V 

V'' V ^ Ol Ufc / 

S '-V-' 

2k 


Consequently, a local optimal design problem in a nonlinear model (1.1) 
corresponds to an optimal design problem in model (2.2) for the trans¬ 
formed vector of parameters Kab. For example, the c-optimal design for 
the model (1.1) can be obtained from the c-optimal design in model (2.2), 
where the vector c is given by c = KaC. Similarly, the local Fi-optimal design 
in the nonlinear regression model (1.1) maximizes \min{Kp^M{^,h)Ka^), 
where M{^,b) is the information matrix in the equivalent linear regression 
model (2.2). For the sake of transparency we will mainly concentrate on the 
linearized version (2.2). The corresponding results in the nonlinear regres¬ 
sion model (1.1) will be briefly mentioned whenever it is necessary. 

A set of functions fi,..., fm-1 —> K is called a weak Chebyshev system 
(on the interval I) if there exists an e G {—1,1} such that 


(2.5) 


e • 


fl{xi) ••• fl{Xm) 
fmixi') ■ ■ ■ fm{Xm) 


> 0 


for all xi,... ,Xm G I with xi < X 2 < • • • < Xm- If the inequality in (2.5) is 
strict, then {/i,..., fm} is called a Chebyshev system. It is well known [see 
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Karlin and Studden (1966), Theorem II 10.2] that if {/i,..., fm} is a weak 
Chebyshev system, then there exists a unique function 

m 

(2.6) ^c:/,(t)=c*^/(t) 

i=l 

with the following properties: 

(i) \c*^f{t)\<iytGl, 

(2.7) (ii) there exist m points si < • • • < Sm such that 

= i = l,...,m. 

The function f{t) is called a Chebyshev polynomial and the points si,..., Sm 
are called Chebyshev points and need not be unique. They are unique if 
1 G span{/i,..., fm}-,fn > 1, and / is a bounded and closed interval, in which 
case Si = min2;g/3:, Sm = max^jg/x. It is well known [see Studden (1968), 
Pukelsheim and Studden (1993), Heiligers (1994) or Imhof and Studden (2001), 
among others] that in many cases the E- and c-optimal designs in the linear 
regression model 

(2.8) Y = (3^f{t) + e 


are supported at the Chebyshev points. For the following discussion assume 
that the functions fi,... ,fm generate a Chebyshev system on the interval I 
with Chebyshev polynomial c*'^f{t) and Chebyshev points si,... ,Sm, dehne 
the m X m matrix F = (/i(sj))™ =i and consider a vector of weights given 

by 


(2.9) 


W = {Wi, . . .,Wmf = 


JF 


-1^* 


where the matrix J is defined by J = diag{(—1), 1,..., (—1)”*}. It is then 
easy to see that 


(2.10) ^ = FJw = ^ G dn, 

II II j=l 


where TZ = conv(/(/) U /(—/)) denotes the Elfving set [see Elfving (1952)]. 
Consequently, if all weights in (2.9) are nonnegative, it follows from Elfving’s 
theorem that the design 


( 2 . 11 ) 



Sm 

Wm 


is c*-optimal in the regression model (2.8) [see Elfving (1952)], where c* G 
denotes the vector of coefficients of the Chebyshev polynomial defined in the 
previous paragraph. The following results relate this design to the E-optimal 
design. 
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Lemma 2.1. Assume that /i,..., /^ generate a Chebyshev system on the 
interval I such that the Chebyshev points are unique. If the minimum eigen¬ 
value of the information matrix of an E-optimal design has multiplicity one, 
then the design defined by (2.9) and (2.11) is E-optimal in the regression 
model (2.8). Moreover, in this case the E-optimal design is unique. 

Lemma 2.2. Assume that the functions fi,... ,fm generate a Chebyshev 
system on the interval I with Chebyshev polynomial c*"^ f{t) and let de¬ 
note the c*-optimal design in the regression model (2.2) defined by (2.11). 
Then c* is an eigenveetor of the information matrix M [Q *, h ), and if the 
corresponding eigenvalue A = is the minimal eigenvalue, then f,** is 

also E-optimal in the regression model (2.8). 


We now discuss the c-optimal design problem in the regression model 
(2.8) for a general vector c G (not necessarily equal to the vector c* 
of coefficients of the Chebyshev polynomial). Assume again that fi,..., fm 
generate a Chebyshev system on the interval /. As a candidate for the c- 
optimal design, we consider the measure 


( 2 . 12 ) 


Cc = Ub) 




where the support points are the Chebyshev points and the weights are 
already chosen such that the expression c^M~^{^f.,b)c becomes minimal, 
that is. 


(2.13) 


\eTJF-^c\ 


Wi = 


Er=i|e JF- 


i = 1,... ,m, 


where = (0,..., 0,1,0,..., 0)^ G M”* denotes the jth unit vector [see Kitsos, Titterington and Torsney 
(1988), Pukelsheim and Torsney (1991) or Pukelsheim (1993)]. The follow¬ 
ing result characterizes the optimal designs for estimating the individual 
coefficients. 


Lemma 2.3. Assume that the functions /i, ..., fm generate a Chebyshev 
system on the interval I. The design defined by (2.12) and (2.13) for the 
vector c = ej is ej -optimal in the linear regression model (2.8) if the system 
{fi\i G {1,... ,m} \ {j}} is a weak Chebyshev system on the interval I. 

If the sufficient conditions of Lemma 2.3 are not satisfied, the determina¬ 
tion of the ej-optimal designs is a substantially harder problem and optimal 
designs for estimating individual coefficients have only been found numeri¬ 
cally in rare circumstances [see Studden (1968) or Dette, Melas and Pepelyshev 
(2004)]. In many cases the resulting designs yield a singular information ma¬ 
trix, which makes its determination by standard methods difficult. 
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Remark 2.4. It is worthwhile to mention that, in general, the suffi¬ 
cient condition of Lemma 2.3 is not satisfied in the regression model (2.2). 
To see this, assume that k > 3, that the function (p is continuously dif¬ 
ferentiable with respect to the second argument and that the functions 
/i(-, 6),..., defined by (2.1) generate a Chebyshev system for any b. 

Define an (m — 1) x (m — 1) matrix 

Fj{x) := ... ,hs{ti),p{ti,bi),p'{ti,bi),... ,p{ti,bj-i), 

(p\ti,bj-i),ip{ti,x),p{ti,bj+i),ip{ti, bk),(p\ti, &fc))™'^\ 

where c < < • • • < tm-i < d,bi^ bj whenever and x ^bi. We choose 

ti,..., tm-i such that g{x) = det Fj{x) ^ 0 (note that the functions /i,..., /^ 
form a Chebyshev system and, therefore, this is always possible) and observe 
that g{bi) = 0, i = 1,... ,k]i j. Because k>3 and g is continuously differ¬ 
entiable, it follows that there exist two points, say x* and x**, such that 
g'{x*) < 0 and g'{x**) > 0. Consequently, there exists an x such that 

0 = g'{x) = det 

where the vector b^ is dehned by bx = (6i,..., 6j_i, x, ftj+i,..., 6*,)^. Note 
that the Chebyshev property of the functions /i,..., fs+ 2 j- 2 , fs+ 2 j, ■ ■ ■, fm 
would imply that all determinants in (2.5) are of the same sign (otherwise 
there exists a b such that the determinant vanishes for < • • • < tm-i)- 
Therefore, the conditions g'{x*) < 0, g'{x**) > 0 yield that there exists an 
X G (x*,x) or X G (x,x**), such that the system of regression functions 

{/l(^) bx), ■ ■ ■ , fs+2j-2{t, bx), fs+2j{t, bx), . . . , fm{t, bx)} 

= {hi{t),hs{t),p{t, bi),ip\t, bi),ip\t, bj_i), 

ip\t, x),ip{t, bj+i),ip'{t, bj+i ),..., bk)} 

is not a weak Chebyshev system on the interval L Finally, in the case fc = 2, 
if lim|fe|_>oo (p{t, b) —> 0, it can be shown by a similar argument that there 
exists an x such that the system 

{hi{t),hs{t),p{t, bi),ip'{t, bi),p{t, x)} 

is not a Chebyshev system on the interval /. 

3. Asymptotic analysis of E- and c-optimal designs. Recall the deh- 
nition of the information matrix in (2.3) for the model (2.2) with design 
space given by I = and assume that the nonlinear parameters vary 

in a compact interval, say h £ [c 2 ,d 2 ], i = 1,... ,k. We are interested in the 
asymptotic properties of E- and c-optimal designs if 

(3.1) bi = x + 5ri, 


i = l,...,k, 
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for some fixed a; e [c 2 , ^ 2 ] j fixed ri < r 2 < • • • < and positive 6 satisfying 
(5 —> 0. Note that condition (3.1) implies that all parameters bi converge to 
X at the same rate 5. For the asymptotic investigations we study for fixed 
e, A > 0 the set 


(3.2) 


^eA = - bj = 5{ri - rj)] 

i,j = 1,... ,t, 6 < e; bi G [ 02 ,^ 2 ], min|ri - rj\ > A L 


introduce the functions 

= fi{t) = hi{t), 




(3.3) 


fs+i{t,x) = fs+i{t) = ^Ht,x), i = l,...,2k, 


and the corresponding vector of regression functions 

(3.4) f{t,x) = {fi{t,x),---Js+ 2 k{t,x))'^, 

where the derivatives of (p{t,x) are taken with respect to the second argu¬ 
ment. Again the dependency of the functions fi on the parameter x will be 
omitted whenever it is clear from the context. Note that for a sufficiently 
smooth function ip, a simple Taylor expansion shows that under assump¬ 
tion (3.1), 

/ T 

{ip{t,bi),ip {t,bi),...,ip{t,bk)) 

= Q{p{t, x),p'{t,x),---, {t, x))'^ + 0 ( 1 ) 

= Qf{t,x) + o{l) 

for an appropriately defined matrix Q G ]^ 2 fcx 2 A; proof of Theorem 

B.l in the Appendix). Therefore, optimal designs in the linear model with 
vector of regression functions given by (3.4) will serve as an approximation 
for the optimal design in model (2.2) if the parameters bi are sufficiently 
close in the sense of (3.1). The following results make this statement more 
precise. 


Lemma 3.1. Assume that the function (p:[ci,di]x[c 2 ,d 2 ]^^ in model (1.1) 
satisfies 

p£C^’‘^’'~^{[ci,di] X [C2,d2]) 

and that for any fixed x G [ 02 ,^ 2 ]; the functions /i, ..., fs+ 2 k defined by (3.3) 
form a Chebyshev system on the interval [ci,di]. For any A > 0 and any 
design on the interval with at least m = s + 2k support points, there 

exists an e > 0 such that for all b G ^eA’ maximum eigenvalue of the 
inverse information matrix M“^(^,6) defined in (2.3) is simple. 
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Theorem 3.2. Assume that the function (p:\ci,di] x [02,^2] —> M m 
model (1.1) satisfies 

e C'0’2fc“i([ci,di] X [C2,d2]) 

and that the systems of functions {fi{t, b),..., fm{t, b)} and {fi{t, x),..., fmit, 2^)} 
defined by (2.1) and (3.3), respectively, are Chebyshev systems on the inter¬ 
val [ci,di] (for arbitrary but fixed bi,... ,bk, x G [c2,^2] with bi / bj when¬ 
ever i j). If e is sufficiently small, then for any b G the design 

defined by (2.9) and (2.11) is the unique E-optimal design in the regression 
model (2.2). 

Note that for b G £'-optimal designs can be obtained explicitly 

by Lemmas 2.1 and 2.2. The support points are the extremal points of the 
Chebyshev polynomial corresponding to the functions in (2.1), while the 
weights are given by (2.9). 

From Remark 2.4 we may expect that, in general, c-optimal designs in 
the regression model (1.1) are not necessarily supported at the Chebyshev 
points. Nevertheless, an analogue of Lemma 3.1 is available for specific vec¬ 
tors c G M™. The proof is similar to the proof of Lemma 3.1 and, therefore, 
omitted. 

Lemma 3.3. Let = (0,... ,0,1,0,..., 0)^ denote the ith unit vector 
in M"*. Under the assumptions of Lemma 3.1 define a vector f = (0,..., 0 , 71 ,..., 72^) 
M”" by 

2 

(3.5) = 72i-i = -72iX!^—’ i = l,...,k. 

(i) If c£ M™' satisfies c^f 0, then for any A > 0, sufficiently small e 
and any b G He, a, the design ffib) defined in (2.12) and (2.13) is c-optimal 
in the regression model (2.2). 

(ii) The assumption c^f f 0 is, in particular, satisfied for the vector c = 
es+ 2 j-i for any j = 1,... ,k and for the vector c = es+ 2 j for any j = 1,... ,k, 
which satisfies the condition 

(3.6) 

Remark 3.4. As pointed out by a referee, some explanation of the set 
He,A is helpful at this point. 

Note that the quantity A < minj^j \ri — rj\ yields some mild restriction 
for the Tj in (3.1) and e can be considered as a cut-off point, such that when¬ 
ever d < e in (3.1), the statements of Theorem 3.2 and Lemmas 3.1 and 3.3 
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apply to the corresponding vector b € This cut-off point cannot be 

determined explicitly because it depends in a complicated way on A, the 
intervals [ci,di], [c 2 , ^ 2 ] and the basic function ip{t,x) used in the regres¬ 
sion model (1.1). Roughly speaking, the results of Lemmas 3.1 and 3.3 and 
Theorem 3.2 hold for any vector 6 in a compact neighborhood of the vector 
(x, ..., x) G In the examples for the rational model discussed in Section 4 
the set fie,A coincides with the set of all admissible values for parameter b. 

Note also that it follows from the proof of Lemma 3.1 that the assumption 
of compactness of the intervals [ci,di] and [ 02 ,^ 2 ] is only required for the 
existence of the set fie,A- In other words, if condition (3.1) is satished and 
6 is sufficiently small, the maximum eigenvalue of the matrix M~^{^,b) will 
have multiplicity one (independently of the domain of the function ip). The 
same remark applies to the statements of Theorem 3.2 and Lemma 3.3. 


Our final result of this section shows that under assumption (3.1) with 
small 5, the local E- and local c-optimal designs for the vectors c considered 
in Lemma 3.3 of Remark 3.4 are very close. To be precise, we assume that 
the assumptions of Theorem 3.2 are valid and consider the design 


(3.7) 


ic — ^cix) 




where si,...,Sm are the Chebyshev points corresponding to the system 
{fi\i = 1,..., m} defined in (3.3), 


(3.8) 


Wi 


\eiJF 


-1 


Er=i|efJF 'c| 


1 = 1,..., m. 


with F = and c G a fixed vector. 


Theorem 3.5. Assume that the assumptions of Theorem 3.2 are satis¬ 
fied and that for the system {/i,..., fm} the Chebyshev points are unique. 

(i) If 6 ^ 0, the design f**{b) defined by (2.11) and (2.9) converges 

weakly to the design defined by (3.7) and (3.8) for c = Cm- 

(ii) If satisfies c '^7 0 for the vector 7 with components defined 

in (3.5) and 5 —> 0, then the design (,*{b) defined by (2.12) and (2.13) con¬ 
verges weakly to the design Cemi^)- 

(iii) The assumption c ^7 0 is, in particular, satisfied for the vector c = 

es+ 2 j-i for any j = 1,... ,k and for the vector c = 6 ^+ 2 ^ for any j = 1,... ,k, 
which satisfies condition (3.6). 


Remark 3.6. Note that Theorem 3.2, Lemma 3.3 and Theorem 3.5 re¬ 
main valid for the local optimal designs in the nonlinear regression model ( 1 . 1 ). 
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This follows by a careful inspection of the proofs of the previous results. For 
example, there exists a set ^e,A such that for all b G tbe maximum 

eigenvalue of the inverse information matrix in the model (1.1) is simple. 
Similarly, if —> 0 and (3.1) is satisfied, c-optimal designs in the nonlin¬ 
ear regression model are given by the design Cc{b) in (2.12) and (2.13) with 
c = KaC, whenever / 0, and all these designs converge weakly to the e^- 
optimal design in the linear regression model defined by the functions (3.4). 

We finally remark that Theorem 3.5 and Remark 3.6 indicate that E- 
optimal designs are very efficient for estimating the parameters Ug+i, 6i,..., ag+k, bk 
in the nonlinear regression model (1.1) and the linear model (2.2), because 
for small differences | bi — bj \ the £'-optimal design and the optimal design for 
estimating an individual coefficient bi {i = 1,... ,k) are close to the optimal 
design for estimating the coefficient bk- We will illustrate this fact in the 
following section, which discusses the rational model in more detail. 

4. Rational models. In this section we discuss the rational model (1.2) 
in more detail, where the design space is a compact or seminfinite interval I. 

In contrast to the work of Imhof and Studden (2001), we assume that the 
nonlinear parameters bi,... ,bk ^ I are not known by the experimenter but 
have to be estimated from the data. A typical application of this model 
can be found in the work of Dudzinski and Mykytowycz (1961), where this 
model was used to describe the relation between the weight of the dried eye 
lens of the European rabbit and the age of the animal. In the notation of 
Sections 2 and 3 we have f{t) = f{t, b) = (/i(t),..., with 

fi{t) = f-^, i = l,...,s, 

(4.1) fs+2i-l{t) = fs+2i-l{t,b) = 

fs+2iit) — fs+2iit^ b) = ^ ^2 ’ * = 1, . . . , fc, 

and the equivalent linear regression model is given by (1.4). The correspond¬ 
ing limiting model is determined by the regression functions fit) = f{t) = 

(flit), ■ ■ ■, fmit))'^, with 

(4.2) fi{t) = f-^, fi+sit) = fs+iit,x) = i = l,...,s. 

Some properties of the functions defined by (4.1) and (4.2) are discussed in 
the following lemma. 

Lemma 4.1. Define B = {b = (bi,... ,bk)'^ €^^\bi ^ I;bi f^bj}. Then the 
following assertions are true: 
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(i) If I is a finite interval or I C [0,oo) and b ^ B, then the system 
{fi{ti,b),..., fm{t,b)} defined in (4.1) is a Chehyshev system on the inter¬ 
val I. Ifx^I, then the system {/i(t, x),...,/m(t, x)} defined by (4.2) is a 
Chebyshev system on the interval I. 

(ii) Assume that b€ B and that one of the following conditions is satis¬ 
fied: 

(a) / C [0,oo), 

(b) s = 1 or s = 0. 

For any j G {1,..., k}, the system of regression functions {fi{t, b)\i = 1,... ,m, 
ifis-\-2j} is a Chehyshev system on the interval I. 

(iii) If I is a finite interval or I G [0, oo), k>2, and j G {1,k}, then 
there exists a nonempty set Wj C B such that for all b G Wj, the system of 
functions {fi{t, b)\i = 1,..., m] i s 2j — 1} is not a Chebyshev system on 
the interval I. 

The case k = l will be studied more explicitly in Example 4.5. Note that 
the third part of Lemma 4.1 shows that for k >2, the main condition of 
Theorem 2.1 in the paper of Imhof and Studden (2001) is not satisfied in 
general for the linear regression model with the functions given by (4.1). 
These authors assumed that every subsystem of {/i,..., fm} which consists 
of m — 1 of these functions is a weak Chebyshev system on the interval I. 
Because the design problem for this model is equivalent to the design prob¬ 
lem for the model (1.2) (where the nonlinear parameters are not known and 
have to be estimated), it follows that, in general, we cannot expect local 
E-optimal designs for the rational model to be supported at the Chebyshev 
points. However, the linearized regression model (1.4) is a special case of the 
general model (2.2) with (p{t,b) = {t — b)~^ and all results of Section 3 are 
applicable here. In particular, we obtain that the E-optimal designs and the 
optimal designs for estimating the individual coefficients a^+i, 6i,..., ag+k^bk 
are supported at the Chebyshev points if the nonlinear parameters bi,... ,bk 
are sufficiently close (see Theorem 3.2, Lemma 3.3 and Remark 3.6). 

Theorem 4.2. (i) If s = 1, then the Chebyshev points si = si{b),..., Sm = 

Sm{b) for the system of regression functions in (4.1) on the interval [—1,1] 
are given by the zeros of the polynomial 

Ak 

(4.3) (l-OE diU —2k+s+i—2if) 1 

i=0 

where Uj{x) denotes the jth Chebyshev polynomial of the second kind [see 
Szego (1975)], U-i{x) = d,U-n{x) =—Un- 2 {x) and the factors dQ,...,dik 
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are defined as the coefficients of the polynomial 

k 

(4.4) ^dif = f[{t-Ti)'^, 

i=0 i=l 

where 

2bi = Ti + —, i = l,...,k. 

Ti 

(ii) Let Qe 03 denote the set of all b such that an E-optimal design for 
the model (1.4) is given by (2.11) and (2.9). Then LIet^^- 

Remark 4.3. (a) The Chebyshev points for the system (4.1) on an 

arbitrary finite interval / C M can be obtained by rescaling the points onto 
the interval [—1,1]. The case s = 0 and I = [0,oo) will be discussed in more 
detail in Examples 4.5 and 4.6. 

(b) It follows from Theorem 3.2 that the set LIe defined in the second part 
of Theorem 4.1 contains the set Re ,a defined in (3.2) for sufficiently small e. 

In other words, if the nonlinear parameters bi,...,bk are sufficiently close, 
the local E-optimal design will be supported at the Chebyshev points with 
weights given by (2.9). Moreover, we will demonstrate in the subsequent 
examples that in many cases the set LIe coincides with the full set B. 

(c) In applications the Chebyshev points can be calculated numerically 
with the Remez algorithm [see Studden and Tsay (1976) or DeVore and Lorentz 
(1993)]. In some cases these points can be obtained explicitly. 

Remark 4.4. We note that a similar result is valid for c-optimal designs 
in the rational regression model (1.4). For example, assume that one of the 
assertions of Lemma 4.1 is valid and that we are interested in estimating a 
linear combination c^/3 of the parameters in the rational model (1.4). We 
obtain from Lemma 3.3 that if c G satisfies 0, then for sufficiently 

small e and any 6 g Re,A) the design ^c(&) defined in (2.12) and (2.13) is 
c-optimal. In particular, this is true for c = es+ 2 j-i (for all j = 1,... ,k) 
and the vector c = es+ 2 j if the index j satishes the condition (3.6). Note 
that due to the third part of Lemma 4.1 in the case k>2, there exists a 
b £ B such that the es+ 2 j-optimal design is not necessarily supported at the 
Chebyshev points. However, from Theorem 3.5 it follows that for a vector 
6 G R satisfying (3.1) with d —> 0 and any vector c with c^y 7 ^ 0, we have for 
the designs f,c*{b) and ^*( 6 ) defined by ( 2 . 11 ) and ( 2 . 12 ) 

where the design is dehned in (3.7) and (3.8) and is Cm-optirnal in 

the limiting model with the regression functions (4.2). We conclude this 
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section with two examples. Further examples considering a finite interval as 
design space and a comparison with ID-optimal designs can be found in the 
technical report of Dette, Melas and Pepelyshev (2002). 

Example 4.5. Consider the rational model 

(4.5) Y = -^+e, te[0,cx)), 

t — b 

with 6 < 0 (here we have I = 1, s = 0, I = [0, oo)). The corresponding equiv¬ 
alent linear regression model is given by 

(4.6) Y = P^f{t,b)+e = ^^ + + e. 

In this case it follows from the first part of Lemma 4.1 that the system 
of regression functions {^, } = {/i(^))/2(0} is a Chebyshev system 

on the interval [0,oo) whenever b <0. Moreover, any subsystem (consisting 
of one function) is obviously a Chebyshev system on the interval [0,oo). 
The Chebyshev points are given by si = 0 and S 2 = y/2\b\ = —\f2b. Now 
we consider the design defined in (2.12) as a candidate for the c- 

optimal design in model (4.6). The weights (for any c G M^) are obtained 
from formula (2.13) and a straightforward calculation shows that the c- 
optimal design h^-s masses uji and 1 — uji at the points 0 and \/2|5|, 

respectively, where 

_|6 (-v/2ci + (2 + v/ 2 )c 26 )|_ 

l^|{| “ -|- (2 -|- \/2 )c 26 | -|- (4 -|- 3\/2 )| — c\Y C2l|} 

It can easily be checked by Elfving’s theorem [see Elfving (1952)] or by the 
equivalence theorem for c-optimality [see Pukelsheim (1993)] that this design 
is, in fact, c-optimal in the regression model (4.6) whenever 

In the remaining cases the c-optimal design is a one-point design supported 
at t = b — In particular, by Lemma 2.3, the ei- and e 2 -optimal designs 
for estimating the coefficients f3i and P 2 in the model (4.6) have weights 
|(2 — \/2),d(2 -|- y/ 2 ) and 1 — at the points 0,\/2|6|, respectively. 

It follows from the results of Imhof and Studden (2001) that an E-optimal 
design in the regression model (4.6) is given by the c*-optimal design for the 
Chebyshev vector c* = (1 -|- \/2)|I|(—2, |6|(1 -|- -v/2))’^, which has masses wi 
and 1 — wi at the points 0 and V 2 \b\, respectively, where 

_ 1 (2 -v/2)(6-4\/2-L62) ^ 1 y/ 2 { 2 V 2 - 2 + b'^) 

b‘^ + 12- 8 V 2 “ ~2 62 + 12-8^/2 

Alternatively, the E-optimal design could be also obtained by the geometric 
method of Dette and Haines (1994), which is especially designed for models 
with two parameters. 
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Fig. 1. Efficiencies of the E-optimal design for estimating the individual coef¬ 

ficients in the regression model (4.6) for various values of b G [—2.5,—!]. Solid line: 
effi(^*(b)), dotted line: eS 2 {ffi (b)). 


In Figure 1 we show the efficiencies of the Fi-optimal design for estimating 
the coefficients /3i and (32 in the regression model (4.6), that is, 

V ) 

(4 7) 

^ ^ r 28(5^ (5^/2 - 7) + 6^(34^/2 - 48) + 396 - 280^2) ^ 

^1 (9\/2 - 11)(62-8^/2+ 12)(762 +16^/2-20) ’ ’ 

I 6^(V2 - 1) + (6^/2 - 8 ) 6 ^ + 68 - 48\/2 if -2 

I (^ 2 - 1 )( 62 - 8 ^ 2 + 12 )( 62 - 6 ^/ 2 + 8 )’ 

[for technical details for this calculation see Dette, Melas and Pepelyshev 
(2002)]. We observe for the ei-efficiency for all 6 < — 1 the inequality 

0.9061 «= lim effi(C^( 6 ))<effi(es(^))<effi(.eE(-l))~ 0.9595, 

6 —>■ — 00 

and similarly for the e 2 -efficiency 

0.9805 « eff 2 m-^)) < eff 2 mb)) < , lim eff 2 mb)) = 1- 

This demonstrates that the S-optimal design yields very accurate estimates 
for the individual parameters in the regression model (4.6). 

We finally mention the results for the local optimal design in the rational 
model (4.5), which maximize or minimize the corresponding functional for 









OPTIMAL DESIGNS 


17 


the matrix K~^M{^^h)K~^, where Ka = diag(l,—i). Obviously, the local 
ei- and e2-designs coincide with the corresponding designs in the equivalent 
linear regression model (4.6). On the other hand, the c-optimal design for 
the rational model (4.5) is obtained from the c-optimal design ^|(6) for 
the model (4.6) with c = KaC = (ci, —C2/a)^. Similarly, the local ^^-optimal 
design for the rational model (4.5) has masses wf and 1 — w\ at the points 
0 and where the weights are given by 

2v/2a^ + (4 + 3v/2)6^ (4 + 3v/2)(2a^ + (1 + ^/2)6^) 

~ 2{4(1 +V2)o2-L(7-L5v/ 2)62} “ 2{4(1 + ^2 )a2-L (7-L 5^/2 )62} ' 

An investigation of the efficiencies for the S-optimal design in the rational 
model (4.5) yields similar results as in the corresponding equivalent linear 
regression model (4.6). For a broad range of parameter values (a, b) the local 
F^-optimal designs in the rational model (4.5) are very efficient for estimating 
the individual parameters. 


Example 4.6. We now discuss E-optimal designs for the rational model 


where 61,62 < 0; [62 — 62] > 0 {k = 2,s = 0). The corresponding equivalent 
linear regression model is given by 


(4.9) 


Y — <^2 Ps , Pa 

t-bi (t-6i)2 t-h2 (6-62)2 


The results of Section 3 show that for sufficiently close parameters 6*, the E- 
and Cj-optimal designs are supported at the Chebyshev points and that the 
c*-optimal design is the unique E-optimal design. In this case the local opti¬ 
mal designs cannot be found explicitly. Therefore, we used these designs for 
any vector (61,62) under consideration as candidates for the optimal designs. 
In other words, we used the Chebyshev points as support points and calcu¬ 
lated the optimal weights from the formulas presented in Section 2 to obtain 
candidates for the local optimal design. The optimality for a concrete choice 
was finally verified by an application of the results in Section 2 (see the dis¬ 
cussion below). For the sake of brevity, we restrict ourselves to model (4.9), 
which corresponds to the local optimal design problem for model (4.8) with 
(oi,02) = (1,1). In our comparison we will also include the E-optimal design 
in the limiting model under assumption (3.1), that is, 


(4.10) 


Y = A- ^2 P 3 Pa 

t — x {t — xY {t — xY {t — xY 


where the parameter x is chosen as x = (61 -|- 62)72. Without loss of generality 
we assume that x = — 1, because in the general case the optimal designs can 
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Table 1 

E-optimal designs for linear regression model (4.9) on the interval [0,oo), 
where bi = —1 — z, b 2 = —1 + z. These designs are E-optimal in the rational 
model (4.8) for the initial parameter a\= 02 = 1. Note that the smallest 
support point of the E-optimal design (ti^) is 0 


z 

0.1 

0.2 

0.3 

0.4 

0.5 

0.6 

0.7 

0.8 

0.9 

0.95 

^2E 

0.18 

0.17 

0.17 

0.16 

0.15 

0.13 

0.11 

0.09 

0.05 

0.03 


1.08 

1.06 

1.03 

0.99 

0.94 

0.87 

0.77 

0.65 

0.47 

0.34 

Iae 

7.85 

7.77 

7.65 

7.46 

7.21 

6.88 

6.45 

5.88 

5.05 

4.43 

wIe 

0.13 

0.13 

0.13 

0.13 

0.12 

0.10 

0.08 

0.07 

0.05 

0.03 


0.26 

0.26 

0.27 

0.26 

0.25 

0.22 

0.20 

0.17 

0.13 

0.10 

W3E 

0.27 

0.27 

0.28 

0.28 

0.28 

0.28 

0.28 

0.28 

0.28 

0.28 

W 4 E 

0.34 

0.33 

0.33 

0.33 

0.36 

0.39 

0.44 

0.49 

0.54 

0.59 


be obtained by a simple scaling argument. The limiting optimal design was 
obtained numerically and has masses 0.13, 0.26, 0.27, 0.34 at the points 0, 
0.18, 1.08 and 7.9, respectively. 

Theorem 3.2 shows that for sufficiently small \bi — 62I) id-optimal designs 
for the model (4.9) are given by the design Cc*ib) defined in (2.9) and (2.11). 
From Lemma 2.2 it follows that the design ^c*(^) is id-optimal whenever 


An* 




<A(2)(M(e^(6),6)) = A(2), 


where Xrain{M{^*^{b),b)) < A(2) < • • • < X(rn) denote the ordered eigenvalues 
of the matrix M{^’^{b),b). The ratio A(2)/Ac* is illustratively depicted in Fig- 



Fig. 2. The ratio X{ 2 )/\c* for the design ^^(fo), where 6 = ( — 1 , 62 ). The designs are 
E-optimal if this ratio is greater than or egual to 1. 
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ure 2 for 6i = 1 and a broad range of 62 values, which shows that it is always 
bigger than 1. Other cases yield a similar picture and, in practice, the local 
£^-optimal design for the rational model (4.8) and the equivalent linear re¬ 
gression model (4.9) is always supported at the Chebyshev points and given 
by (2.9) and (2.11). In Tables 1 and 2 we give the main characteristics and 
efficiencies for the local ii^-optimal design ^%{b) and for the S-optimal design 
™ limiting regression model (4.10). The efficiencies are defined 
by (4.7) and we observe again very good performance of the ii^-optimal de¬ 
signs. The behavior of the design in the limiting regression model (4.9) 
is interesting from a practical point of view because it is very similar to 
the performance of the ii^-optimal design for a broad range of 61 and 62 
values. Consequently, this design might be appropriate if rather imprecise 
prior information for the nonlinear parameters is available. For example, if 
it is known (from scientific considerations) that 61 G [61,61], 62 G [62,62], the 
design Ce(=^ 4^) niight be a robust choice for practical experiments. 

APPENDIX A: PROOFS 

Proof of Lemma 2.1. Let denote an E-optimal design such that 
the minimum eigenvalue A = Aniin(Af (^g, 6)) of the information matrix M (^g, 6) 
has multiplicity one with corresponding eigenvector z G M”^. By the equiva¬ 
lence theorem for the E-optimality criterion [see Pukelsheim (1993), pages 
181 and 182], we obtain for the matrix E = zz^/X, 

= f{t)Ef {t)< I 

for all t G I with equality at the support points of Because the Cheby¬ 
shev polynomial is unique it follows that (up to the factor =f 1) ~ 


Table 2 

The efficiency (4.7) of the E-optimal designs fff; in the linear regression 
model (4.9) on the interval [0, cxd) with bi = —1 — z, b 2 — —1 + z and the efficiency 
of the E-optimal design Cb(“1) Ihe corresponding limiting model (4.10) 


z 


0.1 

0.2 

0.3 

0.4 

0.5 

0.6 

0.7 

0.8 

0.9 

0.95 

eSiia) 


1.00 

1.00 

0.99 

0.94 

0.70 

0.45 

0.50 

0.55 

0.64 

0.78 

efr2(^B) 


0.99 

0.99 

0.98 

0.98 

0.99 

1.00 

1.00 

1.00 

1.00 

1.00 

efr3(^^;) 


1.00 

1.00 

1.00 

0.99 

0.95 

0.87 

0.76 

0.68 

0.58 

0.44 

efr4(a) 


1.00 

0.99 

0.98 

0.94 

0.87 

0.76 

0.62 

0.54 

0.44 

0.31 

effi(el(- 

■1)) 

1.00 

0.99 

0.98 

0.94 

0.79 

0.61 

0.39 

0.32 

0.29 

0.27 

eff2(Cl(- 

■1)) 

0.99 

0.97 

0.94 

0.88 

0.78 

0.65 

0.49 

0.40 

0.31 

0.21 

eff3(el(- 

■1)) 

1.00 

0.99 

0.98 

0.95 

0.88 

0.75 

0.54 

0.40 

0.24 

0.08 

eff4(5l(- 

■1)) 

1.00 

0.99 

0.98 

0.95 

0.90 

0.78 

0.57 

0.41 

0.24 

0.07 
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and that supp(^^) = {si,..., Sm}- Now Theorem 3.2 in Dette and Studden 
(1993) implies that is also c*-optimal, where c* G ffi™' denotes the vector 
of coefficients of the Chebyshev polynomial. Consequently, by the discussion 
of the previous paragraph we have , which proves the assertion. □ 

Proof of Lemma 2.2. From the identity (2.10) and the Chebyshev 
property (2.7) it follows immediately that c* is an eigenvector of the matrix 

m 

M{Cc*,b) = Y,f{s,)f{s,)w, 

i=l 

with corresponding eigenvalue A = l/||c*|p. Now if A = Amin (M(^**,6)), we 
dehne the matrix E = Xc*c*^ and obtain from the Chebyshev properties (2.7) 
that 

f{t)Ef{t)=X{c*^f{t)f < A = Amin(M(C*,6)) 

for all t € I. The assertion of Lemma 2.2 now follows from the equivalence 
theorem for Fi-optimality [see Pukelsheim (1993)]. □ 

Proof of Lemma 2.3. If /i,..., /m generate a weak Chebyshev system 
on the interval I, it follows from Theorem 2.1 in Studden (1968) that the 
design defined in (2.12) and (2.13) is Cj-optimal if 

eefjF~^ej>0, i = 

for some e G {—1,1}. The assertion of Lemma 2.3 is now obtained by Cramer’s 
rule. □ 

Proof of Lemma 3.1. Recall the definition of the functions in (3.3) 
and let 

(A.l) M{^,x) = J^ f{t,x)f'^{t,x)d^ix) 

denote the information matrix in the corresponding linear regression model. 
Because of the Chebyshev property of the functions /i,..., fs+ 2 k, we have 
|M(^,x)| / 0 (note that the design ^ has at least s + 2k support points). It 
follows from Theorem B.l that under the condition (3.1) with <5 —> 0, the 
asymptotic expansion 

(A.2) (5^*’-2M-i(e,6) = /i77^ + o(l) 

is valid, where the vector 7 = (71 ,..., %+2fc)^ is defined by 

2 

%+2i-i = - H ^ i = 

(A.3) 

7i = --- = 7s = 0, 7s+2i = 0, i = l,...,k, 
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and the constant h is given by 

(A.4) h = {{2k-l)\f{M-\i,x))^^^. 

From (A.2) we obtain that the maximal eigenvalue of the matrix 
is simple if 6 is sufficiently small. 

For a fixed value r = (ri, ..., r^) and fixed x G M in the representation (3.1), 
denote by e = e{x,r) the maximal value (possibly oo) such that the matrix 
has a simple maximal eigenvalue for all 5 < e. Then the function 
e: {x, r) e{x, r) is continuous and the infimum 


inf < £{x, h) 


X G [ci, di], min |r* — r,-1 > A, 


= 1 


is attained for some x* G [ci, di] and r* , which implies e* = e(x*, r*) >0. This 
means that for any b G fie*, a, the multiplicity of the maximal eigenvalue of 
the information matrix M~^{^,b) is equal to one. □ 


Proof of Theorem 3.2. The proof is a direct consequence of Lemma 
2.2 and Lemma 3.1, which shows that the multiplicity of the maximum 
eigenvalue of the inverse information matrix of any design has multiplicity 
one, if 6 G fie, A and e is sufficiently small. □ 

Proof of Theorem 3.5. It follows from Theorem 3.2 that the design 
^c* — ^c* (^) local £^-optimal for sufficiently small d > 0. In other words, if 5 
is sufficiently small, the design minimizes max||c|| 2 =i d)c in the 

class of all designs. Note that the components of the vector r = (ri,... ,rfc) 
are ordered, which implies 

ef+2i-i7/0, i = l,...,k. 

Multiplying (B.l) by it then follows from Theorem B.l that for some 

subsequence —> 0 :^** ^ Cix), where the design ^(x) minimizes the func¬ 
tion 

max (c^ 7 )^e^M ^{^,x)em 
llq|2=i 

and the vector 7 is defined by (A.3). The maximum is attained for c = 7 /II 7 II 2 
(independently of the design and, consequently, ^(x) is Cm-optirnal in 
the linear regression model defined by the vector of regression functions 
in (3.4). Now the functions generate a Chebyshev system and 

the corresponding Chebyshev points are unique, which implies that the Cm- 
optimal design (^) i® unique. Consequently, every snbsequence of designs 
^**{b) contains a weakly convergent subsequence with limit and this 

proves the first part of the assertion. For a proof of the second part we note 
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that a c-optimal design minimizes c^M in the class of all designs 

on the interval I. Now if c^7 ^ 0 and 

E ^ 0 

for some i = the same argument as in the previous paragraph 

shows that (b) converges weakly to the design which maximizes the func¬ 
tion (7^c)2e^M (^,x)em- If ej_|_2j_i7 = 0 for all i = 1 ,... ,k, the condition 

c^7 / 0 implies some i = 1,..., A; and the assertion follows by 

multiplying (B.l) by and similar arguments. Finally, the third asser¬ 
tion follows directly from the definition of the vector 7 in (3.5). □ 


Proof of Lemma 4.1. Part (iii) follows from Remark 2.4. Parts (i) and (ii) 
are proved similarly and we restrict ourselves to the first case. For this pur¬ 
pose we introduce the functions b) = {ipiit, 6 ),..., b))'^ with 

(A.5) _ ^ 

i’s+i{t,b) = -i = l,...,2 fc, 

t - hi 

where b= (61 ,..., 62/0)^ is a fixed vector with hi / bj if i ^ j. With the 
notation 


L(A) = 

GkiA) = 


Is 0 

0 Gfc(A) 

/G(A) 

V 


e. 


n2fcx2fc 


G(A), 


«<^)=(-l/A 1 /a) 

(here Ig is the s x s identity matrix) it is easy to verify that 
(A.6) f{t,b)=L{A)ijj{t,bA) + o{l), 

where Ia = {bi,bi + A,... ,bk, bk + A)'^. For a fixed vector T = {ti,, tm)'^ £ 
with ordered components ti < ■ ■ ■ < tm such that ti & I, i = 1,... ,m, 
define the matrices 

F{T,b) = m„b))Z=v ^iT,b) = iUt,,b))Z=r 

Then we obtain from (A.6) 

detF(r,6) = a) 

(A.7) 

_ Yll<i<j<mi^j ~ bi) ni<i<j<fc(^i ~ bj) 

■ niinr=i(t,-6.)2 ’ 
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where the last identity follows from the fact that ipiT, b) is a Cauchy-Vander¬ 
monde matrix, which implies 



Now for any b € B, the right-hand side does not vanish and is of one 
sign independently of T. Consequently, b)\i = 1,..., m} is a Chebyshev 
system on the interval I. The assertion regarding the system {fi{t,x)\i = 
1 ,... ,m} is proved similarly and, therefore, left to the reader. □ 

Proof of Theorem 4.2. The second part of the theorem is a direct 
consequence of Lemma 4.1 and Theorem 3.2, while the first part of the 
proposition follows by Theorem A.2 in Imhof and Studden (2001). □ 


APPENDIX B: AN AUXILIARY RESULT 


Recall the notation in Sections 2 and 3, the definition of the regression 
functions in (2.1) and (3.3) and consider a design ^ on the interval I with at 
least m support points. In this appendix we investigate the relation between 
the information matrices M{^,b) and M{^,b) defined by (2.3) and (A.l), re¬ 
spectively, if condition (3.1) is satisfied, where the components of the vector 
r = (ri,...,Tfc) are different and ordered. 

Theorem B.l. Assume that ip G and ^ is an arbitrary design, 

such that the matrix M{^,b) is nonsingular. If assumption (3.1) is satisfied, 
it follows that for sufficiently small 6 the matrix M{^f,b) is invertible and if 


(B.l) M-(^,6) = r--T(.)f ^ + 

^ ^ ^ ^ ^ (0 77 ^^+ 0 ( 1 )/ 

where the matrices T{d) G and G G and 

are defined by 



2 k 




l2i = '[[{ri-rj) 2 



i = l,...,k, 
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and the matrix F G ]^2fcx2fc defined by 

/O ••• 0 7i/0! 


F = 


Vo 


0 72fc/((2A:-l)!), 


Proof. Define 6i = rid, i = 1,... ,k, = (1, <5,, 

duce the matrices 




-l\T 


and intro- 


(B.2) 

(B.3) 


L = {h,...,^2kf 

C/ = diag ( 


n2kx2k 


1!’ 2!’'"’ (2fc- 1)! 


!,2kx2k 


where £ 2 %-! = £21 = i = 1,... ,k. For fixed t G I, we use the 

Taylor expansions 


j=o 


ip{t,x + 5) = ^ ^ ^), 


J! 




ip'{t,x + 5)= V.- ^ + o((5^^ ^), 


to obtain the representation 


(B.4) 


f{t,b + 5r) = 


0 






0 LU, 

where Ig G denotes the identity matrix and the vector / is of order 

(B.5) f{t) = ^ 0(6^^-^) f. 

It follows from pages 127-129 in Karlin and Studden (1966) that detL = 
ni<j<j<fe(5i — ^j)'^ and consequently, V = {vi,... ,V 2 k) ■= L~^ exists. The 
equality LV = Im implies the equations 

= 0 , v'^i'fi'idj) =0, j / h 

which shows that 5i,..., dj-i, dj+i,..., are zeros of multiplicity two of 
the polynomial u^'0((5) and 6i is a zero of multiplicity one. Because this 
polynomial has degree 2/c — 1, it follows that 


(B.6) 


'^2i 


V'(-5) = (<5--50n 
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and a similar argument shows that 
T / /^ 


(B.7) 




S>i CXi 




j¥=i 


Si - S. 


where the constants ai,..., are given by 

(B.8) = , 

From (B.4) and (B.5) we therefore obtain 
f{t,b + 5r)f'^{t,b + Sr) 


i = I,... ,k. 


L 


0 


0 LU 




Is 0 y 

0 LUJ 


+ o{5 


2k-2\ 


and integrating the right-hand side with respect to the design ^ shows that 


Is 0 
0 LU 


M{i,x) 


(B.9) M(^,6 + <5r) = 

Now define IIi{5) = diag((f^^“^, 
H{5) = 


Is 0 \ 
0 LU 


\ +o((52fc-2) 


j;^2/c—1 x2/c—2^ 0 j^2fcx2/c 




and 


Is 0 
0 

Then we obtain from (B.6) and (B.7) that Hi{6){L~^)'^ = (OI7)-|-o(l), where 
7 = (71 ,... ,72fc)^ is dehned by formula (B.2) and 0 G ]^2fcx2fc-i (;|enotes the 
matrix with all entries equal to zero. This implies that the inverse of the 
matrix M{^,b + Sr) is given by 


M-\C,b + Sr)=H-\6) 

_ ^—Ak-\-4: 


I 0 
0 F 


M \^,x) 


I 0 

0 f'^ 


:t ] + o{l) \ H-\6) 




\ V (0 FM^^^ {^)F^, 


+ oil)\T{6), 


where the matrix F is defined by T = (0|7)C/ ^ G ]g2fcx2fc^ assertion now 
follows by a straightforward calculation which shows that FM^ {^)F'^ = 

/i77^. 

□ 
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